# A Novel Small-Signal Knowledge-Based Neural Network Modeling Approach for Packaged Transistors

Shuxia Yan<sup>1,2</sup>, Xiaoyi Jin<sup>1,2</sup>, Yaoqian Zhang<sup>1,2</sup>, Weiguang Shi<sup>1,2</sup>, Jia Wen<sup>1,2</sup>

<sup>1</sup> School of Electronics and Information Engineering, Tianjin Polytechnic University, Tianjin, China <sup>2</sup> Tianjin Key Laboratory of Optoelectronic Detection Technology and System, Tianjin, China Corresponding Author: Weiguang Shi

**Abstract :** This paper proposes a novel small-signal knowledge-based neural network modeling method for packaged transistors. Separate neural networks are proposed to represent the behaviors of packages covering the core transistor. An advanced training method is developed by utilizing the different parameters to adjust the different characteristics of the packaged transistors, which avoid parameter adjustment repeatedly and speed up the modeling process. The proposed model combing the neural networks with the core transistor model is trained to present the entire small-signal behavior of the packaged transistors. Measurement data of the radio frequency (RF) power laterally diffused metal-oxide semiconductor (LDMOS) transistor are used as the application example to verify the capability of the proposed method. The results demonstrate that the proposed model is more accurate than existing models.

*Keywords* - small-signal model, transistors, neural network, modeling.

| Date of Submission: 17-09-2018 | Date of acceptance: 03-10-2018 |
|--------------------------------|--------------------------------|
|                                |                                |

## I. Introduction

With the development of electronic technology, the accurate computer-aided design (CAD) models of packaged transistors play a decisive role in the circuit/system design [1, 2]. The equivalent-circuit-based model [3] and Electromagnetic (EM) - based model [4] are good for modeling mature technologies and existing transistors. However, with the increasing design complexities and shorter design cycles, the conventional CAD approaches are difficult to satisfy the requirements of precision and speed at the same time. New semiconductor technologies and materials continue to evolve making it necessary to develop efficient modeling algorithms for packaged transistors.

Recently, Knowledge-based neural network modeling techniques have been recognized as useful alternatives to conventional technologies in microwave modeling [5-7]. The knowledge-based model exploit existing knowledge in the form of empirical or equivalent circuit models together with neural networks to develop a more accurate model. The evaluation from input to output of a knowledge-based model is also very fast. Knowledge-based techniques have been utilized in transistors modeling when the mathematical model is not available [8]. However, the existing knowledge-based neural network methods for transistors modeling mainly focus on the core transistor without modeling the package circuit [9, 10]. Systematic and fast modeling methods for packaged transistors are still an open research topic.

In this paper, a novel small-signal modeling method using knowledge-based neural networks for packaged transistors is proposed. Separate neural networks are adopted to represent the nonlinear relationship between the frequency and the S-parameters. An advanced training method is proposed for the model development. The proposed model can match the device well and exceed the current capabilities of existing device models.

## II. Proposed Modeling Method for Packaged Transistors

Packages of transistors typically contain a metal flange and a dielectric window frame. The core transistor is bonded to the die-bond area inside the cavity of the window frame. Metal leads are provided at the input and output sides of the window frame to allow for connection to external circuitry. Based on the physical structure of the packaged transistor, we propose to divide the total structure into three parts: the input package circuit, the core transistor circuit and the output package circuit, and create the CAD modules for these three parts respectively.

## 2.1. Proposed DC model

In general, the packaged circuit is composed of linear devices, which does not affect the DC characteristics of the device. The DC characteristics of the device are affected only by the core circuit. In this

paper, we proposed to use the modeling method in literature [9] to create the core circuit model. We define the knowledge model in this paper to represent the existing transistor model. Because the DC characteristics of the existing transistor model and that of the core circuit are not the same, we propose to establish a mapping network to map the inputs of the knowledge model onto the core circuit. Because the mapping network is nonlinear and unknown, neural networks (ANN<sub>1</sub>) are proposed to be use as the mapping network. The proposed DC model contains the knowledge model and the input mapping network, shown in Fig.1. After training the neural networks, the proposed DC model can represent the behaviors of the core circuit. Let  $V_s^c = [V_{gs}^c, V_{ds}^c]^T$  and  $I_s^c = [I_{gs}^c, I_{ds}^d]^T$  represent the voltage and current signals of the knowledge model respectively. Let  $V_s^f = [V_{gs}^f, V_{ds}^f]^T$  and  $I_s^f = [I_{gs}^f, I_{ds}^d]^T$  represent the voltage model current capabilities by adding more free variables. When the knowledge model operates with the signals ( $V_{gs}^c, V_{ds}^c$ ) instead of the signals ( $V_{gs}^f, V_{ds}^f$ ), the output current of the knowledge model  $I_{ds}^c$  can match that of the modeled device  $I_{ds}^f$  accurately. The neural network is used to describe the nonlinear relationship between the signals of the knowledge model ( $V_{gs}^c, V_{ds}^c$ ) and the signals of the modeled device ( $V_{gs}^f, V_{ds}^f$ ) as

$$(V_{gs}^c, V_{ds}^c) = f_{ANN}(V_{gs}^f, V_{ds}^f, \mathbf{w}_1)$$
<sup>(1)</sup>

where  $f_{ANN}$  represents a multilayer feedforward neural network, and  $\mathbf{w}_1$  is a vector containing all internal synaptic weights in the neural network  $f_{ANN}$ .



Figure.1 DC model structure.

### 2.2. Proposed S-parameters model structure

In the packaged transistors, both the core circuit and the package circuits affect the small-signal characteristics of the device. When the core circuit and the package circuits are modeled in terms of their scattering parameters, scattering-matrix analysis can be applicable to small-signal modeling of the packaged transistor. We proposed to create the small signal model for the input package circuit, the core circuit and the output package circuit respectively, and calculate the *S*-parameters for the model device. The structure of the proposed small signal model is shown in Figure 2. The core module consist of the knowledge model and a neural network  $ANN_1$  represent the small-signal characteristics of the core circuit. The core module can ensure the DC as well as the *S*-parameters characteristics. Two neural networks are proposed to represent the behaviors of the input and output package circuits respectively. The packaged module can be achieved only using the terminal signals, instead of the internal and physical structure information of the transistor. The *S*-matrix module based on the literature [5] is constructed realizing the calculation of *S*-parameters between the packaged transistor and its three parts.



Figure.2 Proposed small-signal model structure.

In Figure 2, the modules ANN<sub>2</sub> and ANN<sub>3</sub> respectively represent the performance of the input/output package circuits which consist of passive components such as bond wires, MOS capacitors and so on. Because the input/output package circuit consists of linear components, the unique input of the packaged modules is the frequency. The output signals of the ANN<sub>2</sub>  $\operatorname{Re}(S_{ij}^{I})$  and  $\operatorname{Im}(S_{ij}^{I})$  are the real and imaginary parts of  $S_{11}, S_{12}$  and  $S_{22}$  of the input packaged circuit. Similarly,  $\operatorname{Re}(S_{ij}^{O})$  and  $\operatorname{Im}(S_{ij}^{O})$  are the output signals of the output package circuit respected by the ANN<sub>3</sub>. For the core module, bias voltages and frequency are the input signals, and the real and imaginary parts of *S*-parameters  $\operatorname{Re}(S_{ij}^{C})$  and  $\operatorname{Im}(S_{ij}^{O})$  are the output signals. The *S*-parameters of the modeled device  $\operatorname{Re}(S_{ij}^{f})$  and  $\operatorname{Im}(S_{ij}^{O})$  can be calculated with *S*-matrix calculation module. In the proposed model, the packaged modules represent the nonlinear relationship between the frequency and the *S*-parameters, which can be described as

$$(\operatorname{Re}(S_{ij}^{I}), \operatorname{Im}(S_{ij}^{I})) = h_{ANN}(freq, \mathbf{w}_{2})$$

$$\tag{2}$$

$$(\operatorname{Re}(S_{ij}^{O}), \operatorname{Im}(S_{ij}^{O})) = g_{ANN}(freq, \mathbf{w}_{3})$$
(3)

where  $h_{ANN}$  and  $g_{ANN}$  represents multilayer feedforward neural network, and  $\mathbf{w}_2$  and  $\mathbf{w}_3$  are vectors containing all internal synaptic weights in the neural network  $h_{ANN}$  and  $g_{ANN}$  respectively.

Usually, the mathematical relationship between the frequency and the *S*-parameters of the packaged circuit is not available. When more bond wires, MOS capacitors and integrated capacitor are added into the package circuit to ease external matching-circuit design, the relationship between the frequency and the *S*-parameters is more complicated. The proposed small-signal model with high precision and high speed can be achieved only using the terminal signals, instead of the internal and physical structure information of the transistor. To make the proposed knowledge-based neural network model represent the DC and small-signal characteristics of the actual device, we propose a new training method in the next subsection.

#### 2.3. Proposed training method

A knowledge-based neural network model cannot represent the packaged transistor accurately until it learns the related data. Therefore, the neural network training is an important step during the model development. The training process automatically adjusts the weights in the neural network so that the output of the model can fit the device data accurately. The training error represents the difference between the device data and the model. Equation (4) and (5) represent the training error of DC, and *S*-parameters characteristics, respectively:

$$E(\boldsymbol{w}) = \frac{1}{2} \sum_{n=1}^{N} ||I(V_{gs}^{f,n}, V_{df}^{f,n}, \boldsymbol{w}_{1}) - I_{D}^{n}||^{2}$$
(4)

$$E(\mathbf{w}) = \frac{1}{2} \sum_{n=1}^{N} ||S(V_{gs}^{f,n}, V_{ds}^{f,n}, freq^{n}, \mathbf{w}_{2}, \mathbf{w}_{3}) - S_{D}^{n}||^{2}$$
(5)

where  $I_D$  and I(.) represent the DC responses of the packaged transistor data and the proposed model, respectively. The superscript *n* represents the training data index, and *N* represents the total number of the training data.  $S_D$  and S(.) represent the *S*-parameters of the packaged transistor data and the proposed model, respectively.

In order to improve the modeling efficiency, we propose a four-stage training method. In the first stage, we initialize the weight value of the ANNs avoiding the proposed model degrading the knowledge model performance. In the second stage, we adjust the weight  $\mathbf{w}_1$  of the neural network in Figure 1 making the DC model match the device data in the DC simulation. In the third stage, we adjust the weights  $\mathbf{w}_2$  and  $\mathbf{w}_3$  of the neural networks in Figure 2 making the proposed small-signal model match the device data in the *S*-parameters simulation. In the fourth stage, we train the proposed overall model by simultaneously train the DC model and the small-signal model again to finally achieve the modeling accuracy. The proposed method controls the DC and small-signal performance of the model with different weight parameters, which reduce the mutual interference of the optimized parameters and avoid changing the optimized parameters repeatedly. After training, the proposed model can be more accurate than the existing model, and it can replace the actual device to plug into an original circuit for design and simulation. The details of the steps for the proposed training process are shown as follows:

Step 1) Initialize the ANN<sub>1</sub> by solving  $V_{gs}^c = V_{gs}^f$  and  $V_{ds}^c = V_{ds}^f$ , and obtain the initial variables  $w_1^0$ . Initialize the ANN<sub>2</sub> by solving Re( $S_{12}^I$ ) = Re( $S_{21}^I$ ) = 1 and other S-parameters equals 0, and obtain the initial

variables  $w_2^0$ . Initialize the ANN<sub>3</sub> by solving  $\operatorname{Re}(S_{12}^o) = \operatorname{Re}(S_{21}^o) = 1$  and other S-parameters equals 0, and obtain the initial variables  $w_3^0$ . This step can avoid degrading the knowledge model performance.

- Step 2) Adjust the weight  $w_1^0$  to  $w_1^*$  by solving the equation (4) and obtain the bias voltage of the knowledge model  $V_{gs}^c$  and  $V_{ds}^c$ , which make the proposed model match the device data in the DC simulation.
- Step 3) Adjust the weights  $w_2^0$  to  $w_2^*$  and  $w_3^0$  to  $w_3^*$  by solving the equation (5) and obtain the  $\operatorname{Re}(S_{ij}^I)/\operatorname{Im}(S_{ij}^I)$  and  $\operatorname{Re}(S_{ij}^O)/\operatorname{Im}(S_{ij}^O)$ , which make the proposed small-signal model match the device data in the *S*-parameter simulation.
- Step 4) Fine tune the weights  $(w_1^*, w_2^*, w_3^*)$  to  $(w_1^{\#}, w_2^{\#}, w_3^{\#})$  making the training error as small as possible, which can improve the performance of the proposed model further.

#### **III.** Experimental Verification

In this experiment, measured data of the laterally diffused metal-oxide semiconductor (LDMOS) packaged transistor AFT18S230 are used as the training data and test data. The range of them used in this example is showed in Table 1. The LDMOS transistor AFT18S290 model in Advanced Design System (ADS) is used as the knowledge model. The mismatch between the knowledge model and the measured data cannot be ignored. The proposed model is trained using the proposed four-stage training method which has been introduced in section 2.3. The proposed model learns the training data by automatically adjusting the weight of the neural networks. Test data which are different with the training data are used to validate the accuracy of the constructed model. Table 2 gives the test error of the knowledge model and the proposed model. In order to further show the detailed results, the I-V and *S*-parameters comparison between the measured data and the models are shown in Fig.3 and Fig.4, respectively. Good agreements between the proposed model and the measured data can be observed.

| Table.1 Training data and test data for DC and S-parameters modeling. |               |                      |                                |               |  |  |
|-----------------------------------------------------------------------|---------------|----------------------|--------------------------------|---------------|--|--|
|                                                                       |               | $V_{gs}(\mathbf{V})$ | $V_{ds}\left(\mathrm{V} ight)$ | freq (GHz)    |  |  |
| DC<br>Simulation                                                      | Training Data | 2.55:0.1:3.25        | -0.5:1:32.5                    |               |  |  |
|                                                                       | Test Data     | 2.6:0.1:3.2          | 0:1:32                         |               |  |  |
|                                                                       |               | 1.35:0.1:1.45        |                                | 1 = 0 0 = 0 1 |  |  |

2.68:0.05:2.78

1.4,2.7,2.75

Training Data

Test Data

| <b>Table.2</b> Test error of the models for DC and <i>S</i> -parameters simulation. |          |              |                             |                    |              |                    |              |                    |              |
|-------------------------------------------------------------------------------------|----------|--------------|-----------------------------|--------------------|--------------|--------------------|--------------|--------------------|--------------|
| Error (%)                                                                           | $I_{ds}$ | $Re(S_{11})$ | $\operatorname{Im}(S_{11})$ | ${\rm Re}(S_{12})$ | $Im(S_{12})$ | ${\rm Re}(S_{21})$ | $Im(S_{21})$ | ${\rm Re}(S_{22})$ | $Im(S_{22})$ |
| Knowledge<br>Model                                                                  | 6.7      | 6.8          | 8.1                         | 12.3               | 12.6         | 8.2                | 9.5          | 28.6               | 31.9         |
| Proposed<br>Model                                                                   | 0.6      | 0.9          | 1.2                         | 3.5                | 3.4          | 1.7                | 1.5          | 1.9                | 1.2          |



Figure.3 I-V comparison between the measured data and the models for the LDMOS transistor.

**S-Parameters** 

Simulation

26.5:1:29.5

28

1.7:0.05:3.1

1.7:0.05:3.1



**Figure.4** Comparison of *S*-parameters between the measured data and the models for the LDMOS transistor at the typical work bias point ( $V_{gs} = 2.75V, V_{ds} = 28V$ ).

In this example, the proposed model are operated in hamonic balance (HB) simulation to further verify the effectiveness of the advanced modeling methodology. The models work at the fundamental frequency  $1.805GH_Z$ , the source impedance  $1.535 - j4.232\Omega$ , the load impedance  $1.403 - j3.748\Omega$ , the bias voltage (2.75V, 28V) and the different input powers ( $P_m$ : from 4.5 to 16.5dBm, step 2dBm). The comparison results of the gain and the power added efficiency (PAE) between the measured data and the models are shown in Fig.5, demonstrating that the HB respons of the proposed model is much closer to the measured data than that of the knowledge model. This result provides a good foundation for the large signals modeling in the future work.



Figure.5 Comparison of the gain and PAE between the measured data and the models for the LDMOS transistor.

## IV. Conclusions

In this paper, the novel model combing the separate neural networks with the core transistor model is proposed to present the entire small-signal behavior of the packaged transistors for the first time. The advanced training method can find the appropriate parameters efficiently, and the trained model can represent the linear characteristic of the practical device accurately. The proposed model is achieved only using the terminal signals, instead of the internal and physical structure information of the packaged transistor. This advantage makes the novel method suitable for more and more complex devices meeting the demand of modern technology development.

#### Acknowledgements

This research was funded by [the National Natural Science Foundation of China] grant number [61601323]; [the Scientific Research Project of Tianjin Education Commission] grant number [2017KJ088]; [the Tianjin Natural Science Foundation] grant number [17JCQNJC01400].

#### References

- Rudolph M, Fager C, Root D E, Nonlinear Transistor Model Parameter Extraction Techniques (UK: Cambridge University Press, 2012).
- [2] P. K. Singya, N. Kumar, V. Bhatia, Mitigating NLD for wireless networks: effect of nonlinear power amplifiers on future wireless communication networks, IEEE Microwave Magazine, 18(5), 2017, 73-90.
- [3] P. H. Aaen, J. A. Plá, J. Wood, Modeling and Characterization of RF and Microwave Power FETs (UK: Cambridge University Press, 2007).
- [4] L. Dan, Z. Lei, H. Rueda, et al, Device Physics and EM Simulation Based Modeling Methodology for LDMOS RF Power Transistors, IEEE Mtt-S International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization for Rf, Microwave, and Terahertz Applications. IEEE, 2017, 79-81.
- [5] Q. J. Zhang, K. C. Gupta, Neural Networks for RF and Microwave Design (Boston : Artech House, 2000).
- [6] H. Kabir, L. Zhang, M. Yu, Smart modeling of microwave devices, IEEE Microwave Magazine, 11(3), 2010, 105-118.
- [7] W. C. Na, Feng F, C. Zhang, A Unified Automated Parametric Modeling Algorithm Using Knowledge-Based Neural Network and l<sub>1</sub> Optimization, IEEE Transactions on Microwave Theory & Techniques, 65(3), 2017, 729-745.
- [8] L. Zhang, J. J. Xu, M. C. E. Yagoub, Efficient analytical formulation and sensitivity analysis of neuro-space mapping for nonlinear microwave device modeling, IEEE Transactions on Microwave Theory and Techniques, 53(9), 2005, 2752-2767.
- [9] L. Zhu; Q. J. Zhang; K. H. Liu. A novel dynamic neuro-space mapping approach for nonlinear microwave device modeling. IEEE Microwave & Wireless Components Letters, 26(2), 2016, 131-133.
- [10] L. Zhu; J. Zhao; Z. Li. A general neuro-space mapping technique for microwave device modeling. Eurasip Journal on Wireless Communications & Networking, 2018(1), 2018, 37.

IOSR Journal of Electrical and Electronics Engineering (IOSR-JEEE) is UGC approved Journal with Sl. No. 4198, Journal no. 45125.

Shuxia Yan "A Novel Small-Signal Knowledge-Based Neural Network Modeling Approach for Packaged Transistors" IOSR Journal of Electrical and Electronics Engineering (IOSR-JEEE) 13.5 (2018): 40-45.

DOI: 10.9790/1676-1305014045

\_\_\_\_\_